5 research outputs found

    Enhancing Wind Power Forecast Precision via Multi-head Attention Transformer: An Investigation on Single-step and Multi-step Forecasting

    Full text link
    The main objective of this study is to propose an enhanced wind power forecasting (EWPF) transformer model for handling power grid operations and boosting power market competition. It helps reliable large-scale integration of wind power relies in large part on accurate wind power forecasting (WPF). The proposed model is evaluated for single-step and multi-step WPF, and compared with gated recurrent unit (GRU) and long short-term memory (LSTM) models on a wind power dataset. The results of the study indicate that the proposed EWPF transformer model outperforms conventional recurrent neural network (RNN) models in terms of time-series forecasting accuracy. In particular, the results reveal a minimum performance improvement of 5% and a maximum of 20% compared to LSTM and GRU. These results indicate that the EWPF transformer model provides a promising alternative for wind power forecasting and has the potential to significantly improve the precision of WPF. The findings of this study have implications for energy producers and researchers in the field of WPF.Comment: This paper is accepted in IJCNN2

    Unlocking the capabilities of explainable fewshot learning in remote sensing

    Full text link
    Recent advancements have significantly improved the efficiency and effectiveness of deep learning methods for imagebased remote sensing tasks. However, the requirement for large amounts of labeled data can limit the applicability of deep neural networks to existing remote sensing datasets. To overcome this challenge, fewshot learning has emerged as a valuable approach for enabling learning with limited data. While previous research has evaluated the effectiveness of fewshot learning methods on satellite based datasets, little attention has been paid to exploring the applications of these methods to datasets obtained from UAVs, which are increasingly used in remote sensing studies. In this review, we provide an up to date overview of both existing and newly proposed fewshot classification techniques, along with appropriate datasets that are used for both satellite based and UAV based data. Our systematic approach demonstrates that fewshot learning can effectively adapt to the broader and more diverse perspectives that UAVbased platforms can provide. We also evaluate some SOTA fewshot approaches on a UAV disaster scene classification dataset, yielding promising results. We emphasize the importance of integrating XAI techniques like attention maps and prototype analysis to increase the transparency, accountability, and trustworthiness of fewshot models for remote sensing. Key challenges and future research directions are identified, including tailored fewshot methods for UAVs, extending to unseen tasks like segmentation, and developing optimized XAI techniques suited for fewshot remote sensing problems. This review aims to provide researchers and practitioners with an improved understanding of fewshot learnings capabilities and limitations in remote sensing, while highlighting open problems to guide future progress in efficient, reliable, and interpretable fewshot methods.Comment: Under review, once the paper is accepted, the copyright will be transferred to the corresponding journa

    WATT-EffNet: A Lightweight and Accurate Model for Classifying Aerial Disaster Images

    Full text link
    Incorporating deep learning (DL) classification models into unmanned aerial vehicles (UAVs) can significantly augment search-and-rescue operations and disaster management efforts. In such critical situations, the UAV's ability to promptly comprehend the crisis and optimally utilize its limited power and processing resources to narrow down search areas is crucial. Therefore, developing an efficient and lightweight method for scene classification is of utmost importance. However, current approaches tend to prioritize accuracy on benchmark datasets at the expense of computational efficiency. To address this shortcoming, we introduce the Wider ATTENTION EfficientNet (WATT-EffNet), a novel method that achieves higher accuracy with a more lightweight architecture compared to the baseline EfficientNet. The WATT-EffNet leverages width-wise incremental feature modules and attention mechanisms over width-wise features to ensure the network structure remains lightweight. We evaluate our method on a UAV-based aerial disaster image classification dataset and demonstrate that it outperforms the baseline by up to 15 times in terms of classification accuracy and 38.3%38.3\% in terms of computing efficiency as measured by Floating Point Operations per second (FLOPs). Additionally, we conduct an ablation study to investigate the effect of varying the width of WATT-EffNet on accuracy and computational efficiency. Our code is available at \url{https://github.com/TanmDL/WATT-EffNet}.Comment: This paper is accepted in IEEE Trans. GRS

    Developing Generative Adversarial Networks for Classification and Clustering: Overcoming Class Imbalance and Catastrophic Forgetting

    No full text
    Generative adversarial networks (GAN) have attracted significant attention from the research community due to their superior clustering and classification abilities. However, data imbalance and continual learning settings cause poor GAN’s performance. In class imbalance problems, the thesis focuses on GAN-based novel data augmentation and oversampling strategies. In GAN-based augmentation, a three-player adversarial GAN game method called class-dependent mixture generator spectral GAN (MGSGAN) is introduced. MGSGAN plays an adversarial game with a discriminator and hyper-spectral classifier to improve its performance by considering the actual data and class-conditionals augmentation generated samples. Our experiments revealed that MGSGAN outperforms other leading approaches by a significant margin for hyperspectral image classification. A second method relies on oversampling techniques, where two adversarial oversampling strategies (adversarial oversampling and adversarial data space oversampling) are introduced with a mixtures of generator parameters updated through classifier parameters. A novel latent preserving single generator under the proposed three-player GAN game settings is introduced. We present four strategies covering all possible GAN games that can be played under three-player settings. The methods were competitive to the state-of-the-art GAN-based oversampling methods. GAN-based adversarial continual learning (ACL) is a recently developed iterative process where a shared feature network is played in an adversarial game with task-specific discriminators. Each task-specific network is introduced to handle the catastrophic forgetting of past tasks. As the job grows, the network structure of ACL also grows simultaneously. Therefore, ACL is not scalable. We propose an online Scalable Adversarial Continual Learning (SCALE) framework where the parameter generator transforms shared features into task-specific features. SCALE surpasses popular baseline methods on accuracy and execution time. Finally, in conjunction with a clustering inference network, a uniform prior-based conditional generative model, ClusterGAN, has recently achieved sub-optimal clustering performance. ClusterGAN may fail to generate all the modes due to disentanglement in the clustered inference network that can’t be fully achieved through only generative. SIMI-ClusterGAN is a new method where a prior network is introduced together with three additional losses to increase disentanglement in the clustering inference network. The method has been validated through benchmark datasets in which balanced and imbalanced scenarios have been considered

    WATT-EffNet: a lightweight and accurate model for classifying aerial disaster images

    No full text
    Incorporating deep learning (DL) classification models into unmanned aerial vehicles (UAVs) can significantly augment search-and-rescue operations and disaster management efforts. In such critical situations, the UAV's ability to promptly comprehend the crisis and optimally utilize its limited power and processing resources to narrow down search areas is crucial. Therefore, developing an efficient and lightweight method for scene classification is of utmost importance. However, current approaches tend to prioritize accuracy on benchmark datasets at the expense of computational efficiency. To address this shortcoming, we introduce the Wider ATTENTION EfficientNet (WATT-EffNet), a novel method that achieves higher accuracy with a more lightweight architecture compared to the baseline EfficientNet. The WATT-EffNet leverages width-wise incremental feature modules and attention mechanisms over width-wise features to ensure the network structure remains lightweight. We evaluate our method on a UAV-based aerial disaster image classification dataset and demonstrate that it outperforms the baseline by up to 15 times in terms of classification accuracy and 38.3% in terms of computing efficiency as measured by Floating Point Operations per second (FLOPs). Additionally, we conduct an ablation study to investigate the effect of varying the width of WATT-EffNet on accuracy and computational efficiency. Our code is available at \url{https://github.com/TanmDL/WATT-EffNet}.Civil Aviation Authority of Singapore (CAAS)Nanyang Technological UniversityThis work was supported by the Civil Aviation Authority of Singapore and Nanyang Technological University (NTU) in collaboration with the Air Traffic Management Research Institute
    corecore